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A Conserved Target Site in HIV-1 Gag RNA is Accessible 
to Inliibition by Both an HDV Ribozyme and a Sliort 
Hairpin RNA 

Robert J Scarborough^ ^ Michel V Levesque^ Etienne Boudrias-Dalle^ ^ Ian C Chute^ Sylvanne M Daniels^ ^ Rodney J Ouellette^ 
Jean-Pierre PerreaulP and Anne GatignoP ^-^ 

Antisense-based molecules targeting HIV-1 RNA have the potential to be used as part of gene or drug therapy to treat HIV-1 
infection. In this study, HIV-1 RNA was screened to identify more conserved and accessible target sites for ribozymes based on 
the hepatitis delta virus motif. Using a quantitative screen for effects on HIV-1 production, we identified a ribozyme targeting 
a highly conserved site in the Gag coding sequence with improved inhibitory potential compared to our previously described 
candidates targeting the overlapping Tat/Rev coding sequence. We also demonstrate that this target site is highly accessible 
to short hairpin directed RNA interference, suggesting that it may be available for the binding of antisense RNAs with different 
modes of action. We provide evidence that this target site is structurally conserved in diverse viral strains and that it is 
sufficiently different from the human transcriptome to limit off-target effects from antisense therapies. We also show that the 
modified hepatitis delta virus ribozyme is more sensitive to a mismatch in its target site compared to the short hairpin RNA. 
Overall, our results validate the potential of a new target site in HIV-1 RNA to be used for the development of antisense therapies. 
MolecularTherapy— Nucleic Acids (2014) 3, e178; doi:10.1038/mtna.2014.31; published online 29 July 2014 
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Introduction 

Over 30 snnall nnolecules are available for the treatnnent of 
Human Innnnunodeficiency Virus 1 (HIV-1) infection, targeting 
the viral enzymes reverse transcriptase (RT), protease and 
integrase, as well as the cellular entry coreceptor, CCR5J 
Although treatment of HIV-1 with combination small molecule 
therapy is effective in preventing acquired immune deficiency 
syndrome, it cannot eradicate the virus and is associated with 
a number of short- and long-term side effects.^ Alternative 
therapeutic strategies for long-term viral suppression with 
low adverse effects are needed and small RNAs represent 
a growing class of molecules with the potential to comple- 
ment or replace current therapies. They are being evaluated 
for use in ex vivo gene therapy^ and with advances that have 
been made in their systemic delivery,^ may soon be evalu- 
ated for use in combination drug therapy. Many small RNAs, 
including antisense oligonucleotides (ASONs), ribozymes 
(Rzs), decoys, aptamers, small nuclear (sn) RNAs, and 
small interfering (si) or short hairpin (sh) RNAs have been 
designed with diverse target sites in the HIV-1 replication 
cycle. ^ Antisense-based RNAs (ASONs, Rzs, snRNAs, sh/ 
si RNAs) can be designed to target HIV-1 RNA, and several 
therapeutic candidates have been described. 

Rzs targeting HIV-1 RNA have been made by modifying 
hammerhead, hairpin^ and bacterial RNase motifs. The 
HDV Rz represents an alternative small Rz motif, that has 
evolved to function in human cells and has the potential to be 
used for the development of therapeutic Rzs.^ To improve the 
specificity of the HDV Rz for its target RNA, the SOFA (Specific 



On/oFf Adaptor) module was engineered^'^° (Figure la). Sev- 
eral SOFA-HDV-Rzs have been identified with the potential 
to target human, ^^'^^ viraP'^^'^^ and bacteriaP^ RNAs, including 
three Rzs that we have evaluated targeting the overlapping 
Tat/Rev coding sequence of HIV-1 RNA.^^ 

Optimal hammerhead Rz target sites in HIV-1 RNA have 
been identified using libraries of Rzs with randomized bind- 
ing arms^^'^^ and a library of Rzs targeting highly conserved 
sequences. Using different methods and datasets to esti- 
mate sequence conservation, sets of optimal siRNAs^^ or 
shRNAs^^'^^ have been identified and two of these studies 
have reported their conservation estimates in 19-21 nucle- 
otide (nt) frames.2°'22 Estimates have also been reported at 
the nt level to identify or characterize Rz,^^ snRNA,^'^ and 
shRNA^^ target sites. In this study, we generated conserva- 
tion estimates at the nt level to identify a set of optimal SOFA- 
HDV-Rz target sites from the beginning of the 5' untranslated 
region (UTR) to the end of the Gag coding sequence 
(Figure lb). Out of 18 new SOFA-HDV-Rzs, one Rz target- 
ing a novel site in the Gag coding sequence was particularly 
effective at reducing viral production. An shRNA targeting the 
same site was shown to be an extremely potent inhibitor of 
viral production and evidence that the identified target site is 
accessible to inhibition in diverse HIV-1 strains is provided. 

Results 

Identification of SOFA-HDV-Rz target sites in HIV-1 RNA 

HIV-1 sequence conservation was estimated to identify target 
sites that are relevant for the majority of HIV-1 strains. Esti- 
mates at the nt level were made (Supplementary File SI) 
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Figure 1 Schematic representation of the SOFA-HDV-Rz and the HIV-1 RNA region used to identify SOFA-HDV-Rz target sites, (a) 

The SOFA-HDV-Rz is illustrated in both its OFF and OA/ conformations. In the OFF conformation, the SOFA blocker (Bl) base pairs with the 
last four nucleotides (nts) of the recognition domain (RD). When the SOFA biosensor (Bs) base pairs with a specific target sequence, the RD 
is released from the Bl sequence and binds at three to five nts upstream from the Bs binding site in the OA/ conformation. The first nt in the 
target site (n + 1) must be a G, forming a wobble base pair with the RD U. The cleavage site is indicated with an arrow and the nt 0^^, which 
can be mutated to disable the catalytic activity of the SOFA-HDV-Rz, is shown as a circle in the Rz backbone, (b) The full-length genomic (g), 
singly-spliced (ss), and doubly-spliced (ds) RNA species of HIV-1 are illustrated. Reading frames for all HIV-1 proteins are shown above the 
different RNAs and the 5' region, used to identify SOFA-HDV-Rz target sites, is underlined. 
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Figure 2 SOFA-HDV-Rz target site identification, (a) Criteria used to identify SOFA-HDV-Rz target sites in HIV-1 RNA based on our 
conservation estimates at the nucleotide (nt) level are illustrated. The number of nts between the RD and the Bs (spacer, 3-5), and the length 
of the Bs (9-1 1 ), were adjusted to avoid poorly conserved positions or to reduce potential off-target effects on human RNAs. A spacer of 4 nt 
and Bs length of 10 nt were used as the default positioning, (b) Target sites were excluded if they were not identical in HIV-1 NL4-3 or if the 
corresponding Rz had potential target sites in human RNAs using a cut-off score of 20 in the Ribosubstrates tool.^^ (c) Sequence conservation 
estimates in the 5' region of HIV-1 RNA are shown for each nt position in HIV-1 NL4-3 with the selected Rz binding sites shaded in grey. SOFA- 
HDV-Rzs were named according to the first nt in their binding site. The dashed line represents the separation between the 5'UTR and Gag 
ORF SOFA-HDV-Rz target sites that were moderately conserved, but did not meet our conservation criteria, are indicated with an asterisk (*). 



using all connplete sequences available in the Los Alannos 
National Laboratory (LANL) database (1850 at the tinne of 
analysis, subtype distribution shown in Supplementary 
Figure S1). These estimates were used to identify highly 
conserved SOFA-HDV-Rz target sites (Figure 2a) that were 



identical in HIV-1 strain NL4-3 (Figure 2b). The Ribosub- 
strates informatics tooP^ was used to exclude SOFA-HDV- 
Rzs targeting 12 highly conserved and 19 moderately 
conserved regions in HIV-1 RNA, due to their potential to 
target human RNAs. 
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Figure 3 Inhibition of HIV-1 production by SOFA-HDV-Rzs. HEK293T cells were seeded in 24-well plates and cotransfected with HIV-1 
pNL4-3 plasmid DNA (75 ng) and one of the indicated psiRNA SOFA-HDV-Rz expression plasmids (750 ng). Viral production was estimated 
48 hours following transfection by measuring the activity of HIV-1 RT in culture supernatants. Each replicate was expressed as a percentage 
of the value obtained for cotransfection with the empty Rz expression plasmid tested in parallel (Relative RT activity). Rzs were evaluated in 
at least three independent experiments with one to three replicate transfections, data are expressed as the mean ± SEM (n = 5-10). Graph 
Pad Prism was used to calculate P values for the effects of each HIV-1 specific SOFA-HDV-Rz compared to the irrelevant control (-RzHBV). 
Results from un-paired t-tests are shown above each SOFA-HDV-Rz that demonstrated a significant inhibition of viral production compared 
to the control (*P<0.05, **P< 0.01 , ***P< 0.001). 



Consistent with previous studies, several highly con- 
served target sites were identified in the 5'LTR U5 region 
within the 5'UTR (Figure 2c). The Gag coding sequence 
had nnuch lower overall conservation; however, four highly 
conserved and, with some exceptions to our conservation 
criteria (Figure 2a), nine moderately conserved target sites 
were identified in this region (Figure 2c). Of the Rzs that 
we have previously evaluated targeting the Tat/Rev exon1 
coding sequence of HIV-1 RNA,^^ the target sites for Tat1 
and Tev1 were highly and moderately conserved, respec- 
tively. Conservation exceptions and sequences of all target 
sites used in this study are illustrated, along with the DNA 
sequences of the corresponding SOFA-HDV-Rz variable 
regions, in Supplementary Table S1. 



SOFA-HDV Rz screen for inhibition of HIV-1 production 

The effect of each SOFA-HDV-Rz expressing plasmid on 
HIV-1 production was evaluated by cotransfection with 
HIV-1 molecular clone pNL4-3 in HEK293T cells, using 
conditions similar to those reported for other Rzs^'^^'^^ and 
shRNAs.2^'22 HIV-1 RT activity was measured to estimate 
the production of virus released into the medium of trans- 
fected cells and effects of Rzs were normalized to cotrans- 
fection of pNL4-3 with an empty Rz expression plasmid. 
An irrelevant Rz targeting Hepatitis B Virus RNA (SOFA- 
HDV-RzHBV, adapted from SOFA-5Rz-3039) was used as a 
negative control and previously described SOFA-HDV-Rzs 
Tati and Tevl^^ were used as positive controls. Compared 
to SOFA-HDV-RzHBV, Rzs targeting both the 5'UTR and 
Gag coding sequences significantly inhibited viral produc- 
tion, with the top candidate (SOFA-HDV-Rz-1498) targeting 
the Gag coding sequence (Figure 3). 



Antisense and mismatched variants of SOFA-H DV-Rz1 498 
are not effective inhibitors of HIV-1 production 

To evaluate the antisense effect of SOFA-HDV-Rz1498, 
we generated an inactive variant (SOFA-HDV-Rz1498A76) 
in which a C to A mutation at position 76 in its backbone 
disables its cleaving capability^^ (Figure 4a). SOFA-HDV- 
Rz1498A76 did not significantly inhibit HIV-1 production at 
similar expression levels to SOFA-HDV-Rz1498 (Figure 4b), 
suggesting that Rz catalytic cleavage is primarily responsible 
for the effects of SOFA-HDV-Rz1498. No effect on the infec- 
tivity of virus from SOFA-HDV-Rz 1498 expressing cells was 
observed (Supplementary Figure S2), suggesting that the 
Rz reduces the amount of virus produced but does not affect 
the quality of the virions. 

SOFA-HDV-Rz1498 variants with either a single or double 
mutation in their biosensor (Bs) sequence were also gener- 
ated to evaluate the potential for SOFA-HDV-Rz1498 to toler- 
ate mismatches with its target (SOFA-HDV-Rz1498Bs1 and 
SOFA-HDV-Rz1498Bs2, Figure 4a). Neither variant inhibited 
HIV-1 production (Figure 4b), suggesting that the effect of 
SOFA-HDV-Rz 1498 is highly sensitive to mismatches with its 
target. The mismatched Rzs had similar in wfro cleavage rate 
constants (k^^^) with significantly reduced maximum cleavage 
(^max) values (Figure 4c), suggesting that part of their fail- 
ure to inhibit HIV-1 production in cells is related to a reduced 
capacity to cleave their target. 

An shRNA targeting the 1498 site is a potent inhibitor 
of HIV-1 production and provides an additive effect in 
combination with SOFA-HDV-Rz1498 

To evaluate the potential for other antisense molecules target- 
ing the SOFA-HDV-Rz1498 target site to inhibit HIV-1 produc- 
tion, we designed shRNA1498. According to our conservation 
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estimates, each nt in the shRNM 498 target site was conserved 
at >80% (Figure 5a, Gag). Compared to a nonsense shRNA 
(shRNAns) as well as SOFA-HDV-Rz1498 and its controls, 
shRNA1498 provided a near complete inhibition of viral pro- 
duction (Figure 5b). This inhibition correlated with a decrease 
in intracellular expression of the HIV-1 Gag polyprotein and 
one of its processing products, capsid (CA). Unexpectedly, 
the decrease in CA expression was much more pronounced 
for both the Rz and shRNA compared to their effects on Gag 
expression. A similar effect was observed for shRNAs target- 
ing sequences in the 5'UTR and Tat/Rev coding sequences of 
HIV-1 RNA (Supplementary Figure S3), suggesting that it is 
not specific to an shRNA targeting the Gag 1498 sequence. 

To compare the potency of shRNA1498 to other candidate 
ShRNAs, we designed shRNA522 and shRNA553, modeled 
after previously characterized siRNAs^° and shRNAs^^ target- 
ing the 5'UTR (Figure 5a, 5'UTR) and shRNA5983, modeled 
after a construct in clinical development targeting the Tat/ 
Rev exoni coding sequence^^ (Figure 5a, Tat/Rev). All four 
shRNAs inhibited HIV-1 production (Figure 5c). The potency 
of shRNA1498 was comparable to that of shRNA553 and 
shRNA5983, with 50% effective concentrations (ECSOs) for 
shRNA plasmids below 5 ng of input DNA, whereas shRNA522 
was much less potent with an EC50 value of 702 ng. 

To evaluate the potential for shRNA1498 and shRNA5983 
to be used in combination with SO FA- HDV- Rzl 498, we 
cotransfected HEK293T cells with HIV-1 pNL4-3 and different 
combinations of Rzs and shRNAs (Figure 5d). To quantify the 
effect of the combinations, we chose an input level of shRNA 
DNA that did not completely inhibit viral production in Figure 
5c. In combination with both shRNA1498 and shRNA5983, 
SOFA-HDV-Rz1498 provided an additional inhibition of HIV-1 
production compared to the control Rz, SOFA-HDV-Rz-HBV. 
The level of inhibition was similar to its effect when cotrans- 
fected alone (50%) (Figure 5b), suggesting that the Rz can 
provide an additive effect in combination with both an shRNA 
targeting the same site (shRNA1498) and an shRNA target- 
ing a different site (shRNA5983). 



Figure 4 Effects of SOFA-HDV-Rz1498 variants on HIV-1 
production, (a) Schematic representation of the SOFA-HDV-Rz1498 
target site (T) and variants (Rz). SOFA-HDV-Rz1498A76 has a 0 to 
A mutation in the Rz backbone, -Rz1498Bs1 and -Rz1498Bs2 have 
one or two nucleotide (nt) variants in the biosensor (Bs), indicated in 
lower case, (b) Effects of each SO FA- HDV- Rzl 498 variant on viral 
production in HEK293T cells were evaluated exactly as in Figure 3. 
Rzs were evaluated in at least three independent experiments with 
one to three replicate transfections (reported as mean ± SEM, n = 
6-10). The relative expression of Rz and 5S RNA loading control for 
the different conditions are shown below for one of two independent 
experiments performed in HEK293T cells seeded in a 12-well 
plate and cotransfected with twice the amount of DNA used for the 
evaluation of viral production in 24-well plates, (c) Single turnover in 
vitro cleavage activities for S0FA-HDV-Rz1 498, -Rz1 498Bs1 , and 
-Rz1498Bs2 were determined at different incubation times with a 
small substrate RNA (Rz»substrate). Cleavage % was measured 
by dividing cleaved products by cleaved + uncleaved products, 
quantified from bands on a gel. A nonlinear regression one phase 
exponential association equation with least squares (ordinary) fit 
was determined using Graph Pad Prism for the different Rzs. All data 
points represent two independent experiments and are reported as 
mean ± SEM (n = 2). The average rate constants (l<^^^) and maximum 
cleavage values (F^^J for the SOFA-HDV-Rzs are reported in a table. 



SOFA-HDV-RZ1498 and shRNA1498 inhibit viral 
production from diverse HIV-1 strains 

As the Gag 1498 target site was accessible to inhibition by 
both a Rz and an shRNA in HIV-1 strain NL4-3, we next 
evaluated whether this inhibition extended to diverse viral 
strains representing subtype B (MaP° and AD82^), C (Indie- 
01^2 and MJ433), D (94UG11434) and circulating recombinant 
form (CRF) 02_AG (97GH-AG135). SO FA- HDV- Rzl 498 inhib- 
ited HIV-1 production from viral strains (Mai, AD8, MJ4, and 
97GH-AG1) with nt variants in proximity to their target sites 
compared to NL4-3 (Figure 6), suggesting that the struc- 
ture of the target site is equally accessible to the Rz in these 
strains. Consistent with results using SOFA-HDV-Rz binding 
site variants (Figure 4, -Bsl and -Bs2), SO FA- HDV- Rzl 498 
did not inhibit HIV-1 production from the strains Indie-CI and 
94UG114, which harbor a single nt variant within their Bs 
binding sites (Figure 6). In contrast, shRNA1498 inhibited 
HIV-1 production from all strains suggesting that it can toler- 
ate a single nt mismatch in its binding site at position 17 and 
can inhibit HIV-1 production in diverse strains. 
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Figure 5 Effects of shRNA1498 on HIV-1 production (a) Sequences targeted by shRNA1498 and control shRNAs targeting HIV-1 RNA 
(shRNA522, shRNA553, and shRNA5983) are shown in relation to our conservation estimates at the nt level from Supplementary File S1 . 

(b) Effects of shRNAI 498 and a nonsense shRNA (shRNAns) on viral production in HEK293T cells were evaluated exactly as in Figure 3. Rzs 
and shRNAs were evaluated in at least three independent experiments with one to three replicate transfections (reported as mean ± SEM, n 
= 6-10). The relative intracellular expression of HIV-1 Gag polyprotein (Gag, p55), matrix-capsid intermediate (MA-CA, p39), and capsid (CA, 
p24) proteins as well as GAPDH loading control are shown below for one of two independent experiments performed in HEK293T cells seeded 
in a 12-well plate and cotransfected with twice the amount of DNA used for the evaluation of viral production. Relative band intensities for Gag 
and CA were calculated using Image J software and are expressed as a fraction of the intensity of Gag in the SOFA-HDV-RzHBV control lane. 

(c) The potency of shRNAs was evaluated by cotransfecting HEK293T cells seeded in a 24-well plate with 100ng of pNL4-3 DNA and 1-750ng 
of shRNA expressing plasmids. For lower amounts of shRNA plasmid DNA (1-500ng), cotransfections were topped up to 850 ng total DNA by 
the addition of an irrelevant plasmid (pBluescript SK+, Stratagene, La Jolla, CA). Relative RT activity measurements were log transformed and 
a nonlinear regression log(inhibitor) versus response equation with least squares (ordinary) fit was determined using Graph Pad Prism for the 
different shRNAs. EC50 values from this equation are mentioned in the text. All data-points represent at least two independent experiments 
with two to three replicates and are reported as mean ± SEM (n = 4-8). (d) Combinations of SOFA-HDV-Rz and shRNA expressing plasmids 
were evaluated in HEK293T cells seeded in 24-well plates and cotransfected with lOOng pNL4-3, lOng of shRNA expressing plasmid and 1 
|jg of Rz expressing plasmid. Data were normalized to cotransfection of 1 00 ng pNL4-3 with 1 |jg of the empty Rz/shRNA expression plasmid 
and are reported as the mean ± SEM from two independent experiments performed in triplicate (n = 6). 



SOFA-HDV-RZ1498 and shRNA1498 have minimal off- 
target effects on liuman RNAs 

The potential for both SOFA-HDV-Rz 1498 and shRNAI 498 
to affect the expression of hunnan nnRNAs was next evalu- 
ated in HEK293T cells cotransfected with HIV-1 pNL4-3. 



Prior to gene expression profiling, the inhibition of viral 
production was confirnned for each condition (data not 
shown) and agreed with results presented in Figure 5b. 
Microarray experinnents were perfornned as triplicate dye 
swaps and the results were expressed as the log2 ratio of 
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Figure 6 Inhibition of HIV-1 production from diverse viral strains by SOFA-HDV-Rz1498 and shRNA1498. (a) The sequence in and 
around the shRNA1498 and SOFA-HDV-Rz1498 target site is shown for HIV-1 NL4-3 (Ml 9921), MAL (K03456), ADS (AF004394), Indie-CI 
(AB023804.1), MJ4 (AF321523), 94UG114 (U88824.1), and 97GH-AG1 (AB04981 1 .1). The overlapping target site for shRNA1498 and 
SOFA-HDV-Rz1498 is highlighted in blue, with both the RD (7 nucleotide (nt)) and Bs (10 nt) binding sites underlined. Nt variations compared 
to HIV-1 NL4-3 are highlighted in red and the start and end positions for each sequence are shown according to their annotation in Genbank. 
(b) Effects of SOFA-HDV-Rzl 498, shRNAI 498 and their controls on HIV-1 production from different molecular clones in HEK293T cells were 
evaluated exactly as in Figure 3, with at least two independent experiments, each including two to three replicate transfections (reported as 
mean±SEM, a7 = 4-10). 



SOFA-HDV-Rzl 498 or shRNAI 498 connpared to the ennpty 
vector cotransfected cells (Supplementary Figure S4). All 
average log2 ratios were low (below 1 .0) suggesting that 
both SOFA-HDV-Rzl 498 and shRNAI 498 can inhibit HIV-1 
production with minimal effects on human mRNA expres- 
sion. The log2 ratios for mRNAs with the greatest extent of 
up- or downregulation are illustrated in Figure 7 and listed 
in Supplementary File S2. Several of these mRNAs were 
found in both SOFA-HDV-Rzl 498 and shRNAI 498 condi- 
tions, suggesting that part of the observed changes may be 
target site specific. 

SOFA-HDV-Rzl 498 and shRNAI 498 
replication in aT lymphocyte cell line 

To evaluate the potential for SOFA-HDV-Rzl 498 and 
shRNAI 498 to inhibit HIV-1 replication, we transfected Jur- 
kat T cells with the same constructs used for their delivery 
to HEK293T cells and selected stably transfected cells in 
the presence of Zeocin. All cell lines had a similar distribu- 
tion of GFP expression from the integrated plasmids and 
proliferated at similar levels (Supplementary Figure S5). 
Following infection with HIV-1 NL4-3, both SOFA-HDV- 
Rzl 498 and shRNAI 498 expressing cells had a lower 
level of viral replication compared to cells expressing 
the control molecules, SO FA- HDV- RzHBV and shRNAns 
(Figure 8). A moderate suppression of viral replication was 
also observed in cells expressing the catalytically inactive 
Rz (SOFA-HDV-Rzl 498A76). These results suggest that in 
addition to its accessibility in HEK293T cells, the Gag 1498 



target site is also accessible to inhibition by both a SOFA- 
HDV-Rz and an shRNA in a T lymphocytic cell line. 

Discussion 

Rzs targeting HIV-1 RNA were among the first gene therapy 
agents tested in a clinical setting, and the only agent tested 
in a phase II vector controlled trial.^^ While no toxicity has 
been observed in this study, the moderate effect reported 
emphasizes the need to find more inhibitory molecules for 
use in gene therapy. Approaches that have been described 
to identify new Rz candidates include the use of (i) RNA 
Polymerase III (Pol III) promoters to achieve higher levels of 
inhibit HIV-1 expression, ^^'^^ (ii) cellular screens to identify optimal target 
sites, ^^'^^ (iii) chimeric Rzs to enhance anti-viral effects, ^^'^^'^^'^^ 
and (iv) alternative motifs such as modified RNase P and 
HDV Rzs.^'^^ In this study, we screened SOFA-HDV-Rzs, 
expressed from the RNA Pol III HI promoter, targeting the 
5'UTR and Gag coding sequence of HIV-1 RNA for effects on 
viral production. Several candidate Rzs with the potential to 
target a broad range of HIV-1 strains were identified (Figure 
2) and one of these (SOFA-HDV-Rzl 498) was a particularly 
effective inhibitor of HIV-1 production (Figure 3). In agree- 
ment with other Rz screens,^^"^^ where only a small number 
of Rzs out of a library of candidates were effective, the major- 
ity of SOFA-HDV-Rzs evaluated did not strongly inhibit HIV-1 
production. One explanation for this could be that Rzs may be 
particularly sensitive to the accessibility of their target sites 
and that most target sites in HIV-1 RNA do not permit Rzs to 
bind and form the appropriate structure required for catalysis. 
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Figure 7 Gene expression changes in cells transfected with 
SOFA-HDV-RZ1498 and shRNA1498. HEK293T cells were seeded 
in 12-well plates and cotransfected with HIV-1 pNL4-3 (150ng) and 
either SOFA-HDV-Rz1498, shRNA1498 or the empty Rz/shRNA 
expression plasmid (1.5 |jg). Forty-eight hours after transfection, 
total RNA was harvested and analyzed by micro-array. The log2 
ratios of mRNAs with the greatest differential variation between 
SOFA-HDV-Rz1498 or shRNA1498 transfected cells and the empty 
expression vector transfected cells are shown. RNAs that were 
up- or downregulated in both conditions are shown in green and 
red respectively. One gene that was downregulated by SOFA-HDV- 
Rz1498 and upregulated by shRNA1498 is shown in yellow. The 
gene identities and log2 ratio values are provided in Supplementary 
File S2. 



Other explanations include a failure of nnost Rzs to fold cor- 
rectly or to localize with their target RNA. 

According to the secondary structure published for the 
full length genonnic RNA of HIV-1 NL4-3,42 target site we 
identified in the Gag coding sequence lies predonninantly 
in a 27 nt single-stranded loop enunnerated 1033 to 1059 
based on its position in NL4-3 HIV-1 RNA (1487 to 1513 in 
NL4-3 DNA). In addition to being accessible to inhibition by 
a SOFA-HDV-Rz, the 1498 target site was also accessible 
to an shRNA, with sinnilar potency in an HIV-1 production 
assay connpared to previously described shRNA candi- 
dates (Figure 5c). Although both SOFA-HDV-Rz 1498 and 
shRNA1498 target the sanne site, our results suggested that 
they could be used together to inhibit viral production with 
an additive effect comparable to the connbination of SOFA- 
HDV-Rz1498 with shRNA5983 targeting a different site 
(Figure 5d). These results are connpatible with SOFA-HDV- 
Rz1498 and shRNA1498 acting in different cellular connpart- 
nnents to provide a nonredundant effect on HIV-1 production. 
Consistent with this, Rzs expressed fronn the RNA Pol III 
pronnoter U6 were shown to localize predonninantly in the 
nucleus, 2^ while shRNAs are exported to the cytoplasnn 
through the exportin 5-dependent pathway. 



Figure 8 Inhibition of HIV-1 replication by SOFA-HDV-Rz1498 
and shRNA1498. (a) Jurkat T cells stably transfected with the 
indicated SOFA-HDV-Rzs and shRNAs were infected with HIV-1 
NL4-3. The average RT activity (cpm/|jl) in culture supernatants 
across four independent infections performed in triplicate (n= 12) 
is shown for days 1 0 and 1 4 following infection, (b) Time course of a 
representative infection (n = 3) followed out to 32 days after infection. 

For both SOFA-HDV-Rz 1498 and shRNA1498, the inhi- 
bition of intracellular OA expression was nnore apparent 
than inhibition of Gag (Figure 5b) and a sinnilar effect was 
observed for shRNAs targeting the 5'UTR and overlapping 
Tat/Rev reading franne (Supplementary Figure S3). It is pos- 
sible that in these experinnents both Rzs and shRNAs inhibit 
intracellular Gag processing to the OA protein, a nnecha- 
nisnn that has been described as contributing to the inhibi- 
tion of HIV-1 production provided by nnodified U1 snRNAs.'^^ 
Another possibility is that intracellular Gag is nnore stable 
connpared to OA in these experinnents, nnaking the effect on 
OA expression nnore apparent at the sannpling tinne evalu- 
ated. Further studies will be required to evaluate whether this 
effect is reproducible in different cell types and experinnental 
conditions and deternnine whether it could be a contributing 
nnechanisnn to the inhibition of viral production provided by 
Rzs and shRNAs targeting HIV-1 RNA. 

An innportant consideration for the design of therapies tar- 
geting HIV-1 is that they target the nnajority of HIV-1 circulat- 
ing strains. According to our estinnates, the target site for both 
SOFA-HDV-Rz-1498 and shRNA1498 was highly conserved 
at the nt level (Figure 5a, Gag). The LANL dataset we used 
to generate our estinnates was nnade up of 1850 connplete 
HIV-1 sequences, connpared to 170 and 495 in earlier stud- 
ies that used this dataset to identify shRNA^^ or siRNA^^ target 
sites, respectively A connparison of the subtype distribution 
between the LANL dataset we used and global estinnates"^"^ 
(Supplementary Figure S1) suggests that HIV-1 subtype 0 
is under-represented in the LANL dataset while subtype B, 
unique reconnbinant fornns (URFs) and CRFs other than AG 
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or AE are all over-represented. While not exactly representa- 
tive of the global distribution, the LANL dataset includes all 
published HIV-1 sequences with representatives from all major 
subtypes and circulating recombinant forms and can eas- 
ily be used to generate conservation estimates at the nt level 
for the identification of antisense target sites."^^ In addition to 
being highly conserved across the different sequences avail- 
able in the LANL database, our results suggest that the 1498 
target site may also have a conserved structure, as the effects 
of shRNA1498 were similar against diverse viral strains with 
sequence variation within and around the target site (Figure 6). 

Another important consideration for the design of antisense 
molecules targeting HIV-1 RNA is their potential to target cel- 
lular RNAs with complementary target sites. While target 
sites can be compared to the human genome, it is not always 
possible to predict off-target effects, as many antisense tech- 
nologies do not require perfectly matched target sites. In 
agreement with in vitro studies,^'^° a single nt mismatch in 
SOFA-HDV-Rz1498 (Figure 4) or its binding site (strain Indie- 
C1 and 94UG114, Figure 6) greatly affected its ability to 
inhibit HIV-1 production. In contrast, an imperfectly matched 
snoRNA linked hammerhead Rz targeting HIV-1 RNA was an 
effective inhibitor of HIV-1 productions^ and several shRNAs 
have been shown to tolerate single and even double mis- 
matches in HIV-1 RNA.2^'4^ Consistent with previous results,^^ 
a single nt mismatch at position 17 did not affect the inhibition 
of viral production provided by shRNA1498 (strain Indie-CI 
and 94UG114, Figure 6) suggesting that SOFA-HDV-Rzs 
are more sensitive to mismatches in their target sites com- 
pared to shRNAs. While this sensitivity would be beneficial 
in terms of limiting off-target effects, it is a liability for target- 
ing a broader range of HIV-1 circulating strains and would 
render SOFA-HDV-Rzs more sensitive to the emergence of 
resistant virus in a therapeutic setting. At levels where both 
SOFA-HDV-Rz1498 and shRNA1498 inhibited HIV-1 produc- 
tion, only small changes in gene expression were observed 
(Figure 7, Supplementary Figure S4), suggesting that 
their target site may be sufficiently different from human 
RNA sequences to avoid major off-target effects on human 
mRNAs. Combined with its potency and effects on diverse 
HIV-1 strains, in addition to our predictions for sequence con- 
servation, these results suggest that shRNA1498 has the 
potential to be used safely and effectively against a broad 
range of HIV-1 circulating strains in a therapeutic setting. 
Future studies to directly compare the safety and efficacy of 
shRNA1498 to other shRNA candidates in more relevant cell 
lines and HIV-1 infection models are planned. 

Although SOFA-HDV-Rz1498 was more effective than our 
previously designed candidates targeting the Tat/Rev region 
of HIV-1 RNA it reached only a 50% inhibition of viral pro- 
duction in cotransfection assays compared to an almost 
complete inhibition at much lower levels by shRNA1498 
(Figure 5b, c). In contrast, it showed a similar potential to sup- 
port a sustained inhibition of viral replication in JurkatT cells 
(Figure 8), suggesting a long-term persistence of its effects 
in an infection model. Improvements in the design or expres- 
sion of SOFA-HDV-Rz1498 may further increase its efficacy 
and modifications to render it less sensitive to mutations in 
its target site are being considered. Overall, our results sug- 
gest that both SOFA-HDV-Rz-1498 and shRNA1498 could 



be further developed for expression in a gene therapy setting 
or by direct delivery with newly developed methods'^ as RNA 
drugs. 



IVIaterials and methods 

In silico identification of SOFA-HDV-Rz target sites. The use 
of the LANL dataset to estimate sequence conservation at 
the nt level has been previously described. Briefly, a multiple 
sequence alignment of all complete HIV-1 sequences were 
downloaded from the LANL database using the QuickAlign 
tool and a consensus sequence with % conservation at each 
position was generated using Jalview sequence editor^^ and 
exported to Microsoft Excel (Supplementary File S1 , raw data 
tab). Several positions in the consensus sequence were repre- 
sented in only a small number of sequences and positions that 
occurred in less than 1 0% of the sequences were removed from 
the raw data to facilitate target site selection (Supplementary 
File S1, conservation estimates tab). Highly conserved target 
sites were selected based on criteria illustrated in Figure 2a. 
Nucleotide BLASP^ was then used to align target sites to HIV-1 
NL4-3 (Ml 9921) and Ri bos ubst rates software (http://www.ribo- 
club.org/ribosubstrates) was used to evaluate the potential for 
the corresponding SOFA-HDV-Rzs to target human RNAs as 
previously described.^^ Briefly the software identifies potential 
target sites for SOFA-HDV-Rzs in a cDNA database allowing 
for variations in the length of the spacer sequence and biosen- 
sor (Figure 1a). Perfectly matched target sites are assigned a 
value of 0, and the score increases by 1 0 for each wobble base 
pair and by 1 00 for each mismatch. We set a value of 20 as the 
cut-off for potential off-target effects, representing at least two 
wobble base pairs between a SOFA-HDV-Rz targeting HIV-1 
RNA and a potential target site in any human RNA. 

Plasmid construction. All SOFA-HDV Rzs and shRNAs were 
expressed from the human RNaseP HI promoter in the vec- 
tor psiRNA-H1GFP::Zeo (InvivoGen, San Diego, CA). SOFA- 
HDV-Rz inserts were generated using an overlapping PCR 
strategy^ ^'^"^'^^ and shRNA inserts were generated by annealing 
complementary oligonucleotides. Sequences for shRNA522 
and shRNA553 inserts were obtained from a previous study^^ 
and using an identical design, shRNAns (nonsense, adapted 
from siControP), shRNA5983 (adapted from shl^^) and 
shRNAI 498 inserts were designed. The cloning strategies and 
sequences of all oligonucleotides used for the generation of 
plasmid inserts are provided in the Supplementary Materials 
and Methods, variable SOFA-HDV-Rz DNA sequences are 
illustrated in Supplementary Table S1. 

Transfections. Cotransfections of HIV-1 molecular clones 
with Rz or shRNA expressing plasmids were performed 
in either 24- or 12-well plate formats as indicated in the 
figure legends. Cell culture conditions are described in the 
Supplementary Materials and Methods. Twenty-four hours 
prior to transfection, HEK293T cells were plated at 2x10^ 
cells/ml and transfections were carried out using TransIT 
reagent (Mirus, Madison, Wl) according to the manufac- 
turer's instructions. Viral production was estimated 48 hours 
following transfection by measuring HIV-1 RT activity in the 
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culture supernatant. To account for differences in viral pro- 
duction between experiments, replicates for each construct 
evaluated were performed in parallel with the empty vector 
psiRNA-H1GFP::Zeo and all data are expressed as a per- 
centage of viral production in the empty vector cotransfected 
cells (Relative RT activity). For each Rz or shRNA construct 
evaluated, we also included the irrelevant control SOFA-HDV- 
Rz-HBV or the nonsense control shRNAns, respectively. 

Rz expression in HEK293T cells. Total RNA extracts were 
harvested from transfected cells using Trizol reagent (Life 
Technologies, Carlsbad, CA) according to the manufacturer's 
instructions. Ten microgram of total RNA was resolved on 
an 8% denaturing polyacrylamide gel, transferred to a nylon 
membrane (AmershamHybond-N+, GE Healthcare, Little 
Chalfont, UK) and UV cross-linked. Membranes were incu- 
bated with ProbeSOFA followed by ProbeSS and visualized 
using a Phosphor screen. Probe sequences, labeling and 
details on the northern blot conditions are provided in the 
Supplementary Materials and Methods. 

In vitro SOFA-HDV-Rz cleavage assay. Single-turnover con- 
ditions (Rz»substrate) were used to evaluate the catalytic 
activity of SOFA-HDV-Rzs as previously described.^^ Briefly 
a trace amount of 5'-end-labeled substrate (<1 nmol/l) was 
incubated at 37 °C with a final concentration of 100 nmol/l 
of the selected SOFA-HDV-Rz. The cleavage reactions were 
initiated by the addition of MgCl2 and samples were taken at 
different time intervals and stopped with loading buffer. Recov- 
ered samples were resolved on a 20% denaturing polyacryl- 
amide gel, visualized using a Phosphor Screen and quantified 
using ImageQuant software (Molecular Dynamics, Sunnyvale, 
CA). The control reaction was performed in the absence of 
Rzs (replaced by water) and its last time interval sample was 
used to subtract the background. For each time point, the per- 
centage of cleavage was calculated (cleaved product counts 
over cleaved + uncleaved products counts). Details on the 
DNA templates used for in vitro transcription, RNA synthesis 
and labeling are provided in the Supplementary Materials 
and Methods. The k ^ and F were then calculated using 

obs max ^ 

GraphPad Prism 5 for each Rz.The rate of cleavage (k^^J was 
obtained by fitting the data to the equation = F^^^ (1 - e-''^, 
where F^ is the percentage of cleavage at time t, F^^^ is the 
maximum percent cleavage and Zeis the rate constant {k^^)- 

HIV-1 protein expression in HEK293T cells. The detection 
of HIV-1 protein expression using an HIV-1 p24 antibody in 
HEK293T cells has been previously described. ^° Briefly, 100 
|jg of total protein was resolved on a 10% denaturing poly- 
acrylamide gel and transferred to a Hybond ECL nitrocellulose 
membrane (GE Healthcare, Little Chalfont, UK). Membranes 
were incubated first with anti-HIVp24 (183-H12-5C) followed 
by anti-GAPDH (sc-32233, Santa Cruz Biotechnology, Dal- 
las, TX), bands were visualized using ECL (GE Healthcare). 
The relative intensity of bands were calculated using Image 
J densitometry software (Version 1.48, National Institutes of 
Health, Bethesda, MD). Data are expressed as Gag or CA 
band intensities relative to the intensity of the Gag band 
in the control SOFA-HDV-RzHBV(Figure 5b) or shRNAns 
(Supplementary Figure S3) lanes. 



Gene expression profiling in HEK293T cells. Gene expres- 
sion levels relative to control transfections were analyzed 
by human gene expression microarrays (ACRI proprietary 
slides). The triplicate dye-swap experiments are described in 
the Supplementary Materials and Methods. 

HIV-1 infection assay. Stable JurkatT lymphocytes were gen- 
erated by electroporation of psiRNA constructs followed by 
selection with Zeocin (InvivoGen, San Diego, CA). The rela- 
tive expression of psiRNA constructs in the stable cell popula- 
tions was estimated by measuring GFP expression from the 
integrated vector with a FACS Calibur flow cytometer (BD Bio- 
sciences, San Jose, CA), and proliferation was determined by 
counting live cells by Trypan blue (Wisent, St Bruno, Canada) 
exclusion using a hemocytometer. Prior to infection, SOFA- 
HDV-Rz or shRNA expressing cells were plated in six-well 
plates at 2x 10^ cells per well. Viral replication was monitored 
by measuring RT activity in culture supernatants at various 
days after infection. Cell culture and selection conditions are 
described in the Supplementary Materials and Methods. 

HIV-1 RT assay. The HIV-1 RT assay used in this study was 
performed as previously described.^^ Briefly, 5 |jl of superna- 
tant was incubated with a polyadenylic acid template (Roche, 
Basel, Switzerland), an oligodT primer (Life Technologies) 
and p2p]-dTTP (3,000 Ci/mmol, Perkin Elmer, Waltham, MA) 
for 2 hours at 37 °C in 50 |jl total reaction mixture. Five micro- 
liter of the reaction mixture was then spotted onto Diethyl- 
aminoethyl (DEAE) filter mat (Perkin Elmer, Waltham, MA) 
and washed five times in 2X SSC buffer, followed by two 
washes in 95% ethanol to remove P^PJ-dTTP not incorpo- 
rated into the poly dT RT product. Counts per minute (cpm) 
were calculated for each sample using a microplate scintil- 
lation counter (MicrobetaTriLux, Perkin Elmer) and are pro- 
portional to the amount of HIV-1 RT enzyme present in the 
reaction mixture. 

Supplementary material 

Figure S1. Subtype distribution of HIV-1 sequences used to 
calculate conservation estimates in comparison to global dis- 
tribution estimates. 

Figure S2. Effect of SOFA-HDV-Rz1498 on the quality of viri- 
ons produced from cotransfected HEK293T cells. 
Figure S3. Gag and Capsid protein expression in cells co- 
transfected with HIV-1 pNL4-3 and different shRNAs target- 
ing HIV-1 RNA. 

Figure S4. Change in mRNA expression ratios compared 
to a control vector for SOFA-HDV-Rz 1498 and shRNA1498 
transfected HEK293T cells. 
Figure S5. Stable Jurkat cell lines analysis. 
Table S1. SOFA-HDV-Rz target sites and DNA coding se- 
quences. 

File S1. Data for sequence conservation estimates. 
File S2. Significantly up- and downregulated gene identities 
and log2 ratios in transfected HEK293T cells for SOFA-HDV- 
Rz1498 or shRNA1498, compared to the empty Rz expres- 
sion plasmid (psiRNA). 
Materials and Methods 
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